SEQUENCE LISTING 



(1> GENERAL INFORMATION: 

(i) APPLICANT: Mann, Michael 3. 

Heche, Randy I. 

(ii) TITLE OF INVENTION: OB FUSION PROTEIN COMPOSITIONS AND 
METHODS 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: _ 

(A) ADDRESSEE: Amgen Inc. 

(B) -STREET: 1840 DeHavilland Drive 

(C) CITY: Thousand Oaks 
■(D) STATE: CA 

( E) COUNTRY: USA 

(F) ZIP: 91320-1789 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 . 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/770,973 

(B) FILING DATE: 20-DEC-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Knight, Matthew W. 

(B) REGISTRATION NUMBER: 3 6,846 

(C) REFERENCE/ DOCKET NUMBER: A- 4 16 



2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 41 

(D) OTHER INFORMATION: /note* "Met = ATG" 



- 50 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
.TCT^x™ AG7TTTAACT TTTAGAAGGA GGAATAACAT ATGGTACCGA TCGAGAAAGT 
TGAGGACGAG ACCAAAACCT .AAXTAAAAC GATCGTTACG CGTATCAACG ACATCAGTCA 
GAGCGAGTCG GTCTCCGCTA AAGAGCGTGT TACGGGTGTG GACTTCATCC CGGGTCTGCA 
CCCGATCCTA AGCTTGTCCA AAATGGAGCA GACCCTGGCT GTATACCAGC AGGTGTTAAC 
CTCCCTGCCG TCCCAGAACG TTCTTCAGAT GGGTAAGGAG CTCGAGAACC TTCGCGACCT 
GGTGCAGGTG CTGGCATTCT CGAAATGGTG GTCCCTGCCG CAGACCTCAG GTCTTCAGAA 
ACCGGAATCC GTGGAGGGGG TCCTGGAAGC ATCCCTGTAC AGCACCGAAG TTOTTGGTGT 
GTGCGGTGTG CAGGGTTCCC TTCAGGACAT CCTTCAGCAG CTGGACGTTT CTCCGGAATG 
TTAATGGATC C 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 491 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: • cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
AGATCTAAAC TCAAAATTGA AAATCTTCCT CCTTATTGTA TACCATGGCT AGGTCTTTCA 
AGTCCTGCTG TGGTTTTGGA ATTAATTTTG CTAGCAATGC GCATAGTTGC TGTAGTCAGT 
GTGGGTCAGC CAGAGGCGAT TTGTCGCACA ATGGCCAGAC CTCAAGTAGG GCCCAGACGT 
GGGCTAGGAT -TCGAACAGGT TTTACCTGGT CTGGGACCGA CATATGGTCG TCCACAATTG 
GAGGGACGGC AGGGTCTTGC AAGAAGTCTA GCGATTGCTG GAGCTCTTGG AAGCGCTGGA 
CGACGTGGAC GACCGTAAGA GGTTTAGGAC GAGGGACGGC GTCTGGAGTC CAGAAGTCTT 
TGGCCTTAGG GACCTGCCCC AGGACCTTCG TAGGGACATG TCGTGGCTTC AACAACGAGA 
CAGGGCAGAC G TC C C AAGGG AAGTCCTGTA GGAAGTCGTC GACCTGCAAA GAGGCCTTAC 
AATTACCTAG G 



50 
L20 
130 

240 

300 

360 

420 

480 . 

491 



60 
120 
180 
240 
300 
360 
420 
480 
491 
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!) INFORMATION ?OR 5£Q ID MO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 
(3) TYPE: amino acid 

(C) 3TRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY: Protein 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /notes "Met (ATG) starts at -1* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lys 
1 5 10 15 

Thr lie Val Thr Arg lie Asn Asp lie Ser His Thr Gin Ser Val Ser 

20 25 . 30 

Ala Lys Gin Arg Val Thr Gly Leu Asp Phe lie Pro Gly Leu His Pro 
35 40 45 

lie Leu Ser Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin 

50 '55 60 

Val Leu Thr Ser Leu Pro Ser Gin Asn Val Leu Gin lie Ala Asn Asp 

55 70 75 30; 

Leu Glu Asn Leu Arg Asp Leu Leu His Leu Leu Ala Phe Ser Lys Ser 
85 90 95 

Cys Ser Leu Pro Gin Thr Ser Gly Leu Gin Lys Pro Glu Ser Leu Asp 

100' 105 110 

Gly Val Leu Glu Ala Ser Leu Tyr Ser Thr Glu Val Val Ala Leu Ser 

115 1 120 125 

Arg Leu Gin Gly Ser Leu Gin Asp lie Leu Gin Gin Leu Asp Val Ser 
130 135 140 



Pro Glu Cys 
145 
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(2) INFORMATION FOR SEQ TO NO : 4 : 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 454 base oairs 
(3) TYPE'; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
(3) LOCATION: 4 

(D) OTHER INFORMATION: /note* -Met = ATG' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CATATGGTAC CGATCCAGAA AGTTCAGGAC GACACCAAAA CCTTAATTAA 


AACGATCGTT 


60 


ACGCGTATCA ACGACATCAG TCACACCCAG TCGGTGAGCT CTAAACAGCG 


TGTTACAGGC 


120 


CTGGACTTCA TCCCGGGTCT GCACCCGATC CTGACCTTGT CCAAAATGGA 


CCAGACCCTG 


130 


GCTGTATACC AGCAGATCTT AACCTCCATG CCGTCCCGTA ACGTTCTTCA GATCTCTAAC 


240 


GACCTCGAGA ACCTTCGCGA CCTGCTGCAC GTGCTGGCAT TCTCCAAATC 


CTGCCACCTG 


300 


CCATGGGCTT CAGGTCTTGA GACTCTGGAC TCTCTGGGCG GGGTCCTGGA 


AGCATCCGGT 


360 


TACAGCACCG AAGTTGTTGC TCTGTCCCGT CTGCAGGGTT CCCTTCAGGA 


CATGCTTTGG 


420 


CAGCTGGACC TGTCTCCGGG TTGTTAATGG ATCC 




454 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 
IB) TYPE: nucleic acid 
(Cr STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTATACCATG GCTAGGTCTT TCAAGTCCTG CTGTGGTTTT GGAATTAATT TTGCTAGCAA 
TGCGCATAGT TGCTGTAGTC AGTGTGGGTC AGCCACTCGA GATTTGTCGC ACAATGTCCG 



60 
120 
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GACCTGAAGT 


AGGGCCCAGA 


CGTGGGCTAG 


GACTGGAACA GGTTTTACCT 


GGTCTGGGAC 


130 


CGACATATGG 


TCGTCTAGAA 


TTGGAGGTAC 


GGCAGGGCAT 


TGCAAGAAGT 


CTAGAGATTG 


240 


CTGGAGCTCT 


TGGAAGCGCT 


GGACGACGTG 


CACGACCGTA 


AGAGGTTTAG 


GACGGTGGAC 


300 


GGTACCCGAA 


GTCCAGAACT 


CTGAGACCTG 


AGAGACCCGC 


CCCAGGACCT 


TCGTAGGCCA 


360 


ATGTCGTGGC 


TTCAACAACG 


AG AC AGGGC A 


GACGTCCCAA 


GGGAAGTCCT 


GTACGAAACC 


420 


GTCGACCTGG 


ACAGAGGCCC 


AACAATTACC 


TAGG 






454 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



( ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION- 1 

•(D) OTHER INFORMATION : /note- "Met (ATG) starts at -1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Val Pro He Gin Lys Val Gin Asp Asp Thr Lys Thr Leu lie Lya£ 
1 5 10 15 

Thr He- Val Thr Arg He Asn Asp He Ser His Thr Gin Ser Val Ser 

2.0 25 30 

Ser Lys Gin Arg Val Thr Gly Leu Asp Phe He Pro Gly Leu His Pro 
35 40 45 

He Leu Thr Leu Ser Lys Met Asp Gin Thr Leu Ala Val Tyr Gin Gin 

50 55 60 

He Leu Thr Ser Met Pro Ser Arg Asn Val Leu Gin He Ser Asn Asp 
65 70 75 30 

Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe Ser Lys Ser 
85 90 95 

Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp Ser Leu Gly 
100 105 HO 
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Gly Val Lau Glu Ala 3ar Gly Tyr Ssr rh, 

115 ^ -[*J bar Thr Glu Val Val Ala Leu 3ar 

12 5 

Arg Lau Gin Gly Sar L*u Gin A~r> vc-- , 

130 „J A " P M9C Lsu Trp Gln Lau Asp L 



au Ser 

140 



Pro Gly Cys 
145 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: misc.f eature 

(B) LOCATION: 4 

(D) OTHER INFORMATION : /note- "Met = ATG" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CATATGGAAC CCAAATCTTG TGACAAAACT CACACATGCC CACCGTGCCC AGCACCTGAA 
CTCCTGGGGG GACCGTCAGT CTTCCTCTTC CCCCCAAAAC CCAAGGACAC CCTCATGATC 
TCCCGGACCC CTGAGGTCAC ATGCGTGGTG GTGGACGTGA GCCACGAAGA CCCTGAGGTC 
AAGTTCAACT GGTACGTGGA CGGCGTGGAG GTGCATAATG CCAAGAGAAA GCCGCGGGAG 
GAGCAGTACA ACAGCACGTA* CCGTGTGGTC AGCGTCCTCA CCGTCCTGCA CCAGGACTGG 
CTGAATGGCA AGGAGTACAA GTGCAAGGTC TCCAACAAAG CCCTCCCAGC CCCCATCGAG 
AAAACCATCT CCAAAGCCAA A^GGCAGCCC CGAGAACCAC AGGTGTACAC CCTGCCCCCA 
TCCCGGGATG AGCTGACCAA GAACCAGGTC AGCCTGACCT GCCTGGTCAA AGGCTTCTAT 
CCCAGCGACA TCGCCGTGGA GTGGGAGAGC AATGGGCAGC CGGAGAACAA CTACAAGACC 
ACGCCTCCCG TGCTGGACTC CGACGGCTCC TTCTTCCTCT ACAGCAAGCT CACCGTGGAC 
AAGAGCAQGT GGCAGCAGGG GAACGTCTTC TCATGCTCCG TGATGCATGA GGCTCTGCAC 
AACCACTACA CGCAGAAGAG CCTCTCCCTG TCTCCGGGTA AAGTACCGAT CCAGAAAGTT 



60 
120 
130 
240 
300 
360 
420 
480 
540 
600 
660 
720 



3 3 



CAGGACG AC A CCAAAACCTT 


AATTAAAACG 


ATCGTTACGC 


GTATCAACGA 


CATCAGTCAC 


730 


ALCCnG i CGG 


7GAGCTCTAA 


ACAGAAAGTT 


ACAGGCCTGG 


ACTTCATCCC 


GGGTCTGCAC 


340 


CCGATCCTGA 


CCTTGTCCAA 


AATGGACCAG 


ACCCTGGCTG 


TATACCAGCA 


GATCTTAACC 


900 


TCCATGCCGT 


CCCGTAACGT 


TATCCAGATC 


TCTAACGACC 


TCGAGAACCT 


TCGCGACCTG 


960 


CTGCACGTGC 


TGGCATTCTC 


CAAATCCTGC 


CACCTGCCAT 


GGGCTTCAGG 


TCTTGAGACT 


1020 


CTGGACTCTC 


TGGGCGGGGT 


CCTGGAAGCA 


TCCGGTTACA 


GCACCGAAGT 


TGTTGCTCTG 


1090 


TCCCGTCTGC 


AGGGTTCCCT 


TCAGGACATG 


CTTTGGCAGC 


TGGACCTGTC 


TCCGGGTTGT 


1140 


TAATGGATCC 












1150 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 1 cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GTATACCTTG 


GGTTTAGAAC 


ACTGTTTTGA 


GTGTGTACGG GTGGCACGGG 


TCGTGGACTT 


50 


GAGGACCCCC 


CTGGCAGTCA 


GAAGGAGAAG 


GGGGGTTTTG GGTTCCTGTG GGAGTACTAG 


• 120 


AGGGCCTGGG 


GACTCCAGTG 


TACGCACCAC 


CACCTGCACT CGGTGCTTCT GGGACTCCAG 


180 


TTCAAGTTGA CCATGCACCT 


GCCGCACCTC 


CACGTATTAC 


GGTTCTGTTT 


CGGCGCCCTC 


240 


CTCGTC ATGT 


TGTCGTGCAT 


GGCACACCAG 


TCGCAGGAGT 


GGCAGGACGT 


GGTCCTGACC 


300 


GACTTACCGT TCCTCATGTT 


CACGTTCCAG 


AGGTTGTTTC 


GGGAGGGTCG 


GGGGT AGCTC 


360 


TTTTGGTAGA 


GGTTTCGGTT 


TCCCGTCGGG 


GCTCTTGGTG 


TCCACATGTG 


GGACGGGGGT 


420 


AGGGCCCTAC 


TCGACTGGTT 


CTTGGTCCAG 


TCGGACTGGA 


CGGACCAGTT 


TCCGAAGATA 


480 


GGGTCGCTGT 


AGCGGCACCT 


CACCCTCTCG 


TTACCCGTCG 


GCCTCTTGTT 


GATGTTCTGG 


540 


TGCGGAGGGC 


ACGACCTGAG 


GCTGCCGAGG 


AAGAAGGAGA 


TGTCGTTCGA 


GTGGCACCTG 


600 


TTCTCGTCCA 


CCGTCGTCCC 


CTTGCAGAAG 


AGTACGAGGC 


ACTACGTACT 


CCGAGACGTG 


560 


TTGGTGATGT 


GCGTCTTCTC 


C- G AG AGGG AC 


AGAGGCCCAT 


TTCATGGCTA 


GGTCTTTCAA 


720 



***** 



.-•rx 



DO 



: 



GTCCTGCTGT GGTTTTGCAA TTAATTTTGC TAGCAATGCG 

.AGCAATGCG CATAGTTGCT GTAGTCAGTG 

r«=Tc AO c= actcgagatt tgtcttga, OTracc ^agtaggg CCCMicGT0 

GGCTAGGAGT GGAAGAGGTT TTACCTGGTC tGGGACCGAC ATATGGTCGT CTAGAATTGG 
AGGTACGCGA GGGCATTGGA ATAGGTCTAG AGATTGCTGG AGCTCTTGGA AGCGCTGGAC 
GACGTGCACG ACCGTAAGAG GTTTAGGACG GTGGACGGTA CCCGAAGTCC AGAACTCTGA 
GACCTGAGAG ACCCGCGGCA GGACCTTCGT AGGGCAATGT CGTGGCTTCA ACAACGAGAC 
AGGGCAGACG TCCCAAGGGA AGTCCTGTAC GAAACCGTCG ACCTGGACAG AGGCCCAACA 
ATTAC CTAGG 



: -d-i 

■■■■>' 



<2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 amino acids 
IB) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: pro Cain 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 

(D) OTHER INFORMATION : /note- "Met (ATG) starts at -1- / 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Mat Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro 
5 10 15 

Ala Pro. Glu Leu Leu Gly Gly Pro Sar Val Pha Lau Pha Pro Pro Lys 

20 ' 25 30 

?ro Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val 
35 .40 45 

Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Pha Asn Trp Tyr 

30 55 60 

Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu 

70 75 ao 
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Oln Tyr isn 3ar Thr Tyr Ar , ,,,, ^ ^ ^ 
Gin Asp Trp Leu Asa Gly L ys Giu ^ 



Thr Val Lau His 
95 



105 



Lys Cys Lys Val 3 



sr Asn Lys 



Ala Leu Pro Ala Pro Ila Glu Lys Thr 
115 



110 



He Ser Lys Ala Lys Gly Gin 
"° 125 
Pro Arg Glu Pro Gin Val Tyr Thr 



130 



135 



Leu Pro Pro Ser Arg Asp Glu Leu 
140 



Thx Cys A „ 01n w s„ L.u Thr cys u» w L „ oly ph . ^ pr , 

155 ISO 
*.r Asp II. Al. vjl 0 1„ Trp clu s.r ^ a iy. oln P „ olu A ,„ A „ 



Tyr Ly, Thr Thr Pro Pro Vol Lou Asp s.r Asp Gly 



185 



175 



Ser Phe Phe Leu 
190 



Tyr S.r Ly. L . u Thr val Asp s . r ^ ^ Asn V.! 



200 



205 



Phe Ser Cys Ser Val 
210 



Met His Glu Ala Leu His Asn. His Tyr Thr Gin 



215 



220 



Lys Ser Leu Ser Leu Ser Pro Gly Ly , Val Pro Qln Ly , ^ ^ 



235 



Asp Asp Thr Lys Thr Leu lie Lys Thr He Val Thr 
He Ser His Thr Gin 



250 



240 



Arg lie Asn Asp 
255 



2 60 LVS ° ln Ly9 Val Thr Le * 

265 



270 



Asp Phe lie fro oly Leu Hi3 Pro ne Leu ^ ^ 



280 28S 

Gin Thr Leu Ala Val Tyr Gin Gin lie Leu Thr Ser Met Pro Ser Arg 

295 300 

3 0? Ila Gln ??* Asn As P Le " Glu Asn Leu Arg Asp Leu Leu 



310 



315 320 
His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Trp Ala Ser 



325 



330 



Gly 



335 



Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr 
340 345 



350 



Ser Thr Glu Val Val Ala Leu 



Ser Arg Leu Gin Gly Ser Leu Gin Asp 
355 360 355 
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Met Leu Trp Gin Lau Asp Leu Sar Pro Gly C ys 

J/o 375 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 
IB) . TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note- "Mee = ATG" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CATATGGAAC CAAAATCTGC TGACAAAACT CACACATGTC CACCTTGTCC AGCTCCGGAA 
CTCCTGGGGG GTCCTTCAGT CTTCCTCTTC CCCCCAAAAC CCAAGGACAC CCTCATGATC 
TCCCGGACCC CTGAGGTCAC ATGCGTGGTG GTGGACGTGA GCCACGAAGA CCCTGAGGTC 
AAGTTCAACT GGTACGTGGA CGGCGTGGAG GTGCATAATG CCAAGACAAA GCCGCGGGAG 
GAGCAGTACA ACAGCACGTA CCGTGTGGTC AGCGTCCTCA CCGTCCTGCA CCAGGACTGG 
CTGAATGGCA AGGAGTACAA GTGCAAGGTC TCCAACAAAG CCCTCCCAGC CCCCATCGAG 
AAAACCATCT CCAAAGCCAA AGGGCAGCCC CGAGAACCAC AGGTGTACAC CCTGCCCCCA 
TCCCGGGATG AGCTGACCAA GAACCAGGTC AGCCTGACCT GCCTGGTCAA AGGCTTCTAT 
CCCAGCGACA TCGCCGTGGA GTGGGAGAGC AATGGGCAGC CGGAGAACAA CTACAAGACC 
ACGCCTCCCG TGCTGGACTC CGACGGCTCC TTCTTCCTCT ACAGCAAGCT CACCGTGGAC 
AAGAGCAGGT GGCAGCAGGG GAACGTCTTC TCATGCTCCG TGATGCATGA GGCTCTGCAC 
AACCACTACA CGCAGAAGAG CCTCTCCCTG TCTCCGGGTA AAGTACCGAT CCAGAAAGTT 
CAGGACGACA CCAAAACCTT AATTAAAACG ATCGTTACGC GTATCAACGA CATCAGTCAC 
ACCCAGTCGG TGAGCTCTAA ACAGAAAGTT ACAGGCCTGG ACTTCATCCC GGGTCTGCAC 
CCGATCCTGA CCTTGTCCAA AATGGACCAG ACCCTGGCTG TATACCAGCA GATCTTAACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
730 
340 
900 
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TCCATGCCGT CCCGTAACGT ^ATrrirA^ ™ 

"COT .ATCCAOATC TCTAACGACC TCGAGAACCT TCGCGACCTG 

CTGCACGTGC tGGCATTCTC CAAATCCGC CACCTGCCA, GGGCTTCAGG TCTTGAGACT 
CTGGACTCTC TGGGCGGGGT CCTGG«GCA TCCGGTTACA GCACCGAACT TGTTGCTCTG 
TCCCGTCTCC AGGGTTCCCT TCAGGACATG CTTTGGCAGC TGGACCTGTC TCCGGGTTGT 
TAATGGATCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 1150 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTATACCTTG GTTTTAGACG ACTGTTTTGA GTGTGTACAG GTGGAACAGG TCGAGGCCTT 
GAGGACCCCC CAGGAAGTCA GAAGGAGAAG GGGGGTTTTG GGTTCCTGTG GGAGTACTAG 
AGGGCCTGGG GACTCCAGTG TACGCACCAC CACCTGCACT CGGTGCTTCT GGGACTCCAG 
TTCAAGTTGA CCATGCACCT GCCGCACCTC CACGTATTAC GGTTCTGTTT CGGCGCCCTC 
CTCGTCATGT TGTCGTGCAT GGCACACCAG TCGCAGGAGT GGCAGGACGT GGTCCTGACC 
GACTTACCGT TCCTCATGTT CACGTTCCAG AGGTTGTTTC GGGAGGGTCG GGGGTAGCTC 
TTTTGGTAGA GGTTTCGGTT TCCCGTCGGG GCTCTTGGTG TCCACATGTG GGACGGGGGT 
AGGGCCCTAC TCGACTGGTT CTTGGTCCAG TCGGACTGGA CGGACCAGTT TCCGAAGATA 
GGGTCGCTGT AGCGGCACCT CACCCTCTCG TTACCCGTCG GCCTCTTGTT GATGTTCTGG 
TGCGGAGGGC ACGACCTGAG GCTGCCGAGG AAGAAGGAGA TGTCGTTCGA GTGGCACCTG 
TTCTCGTCCA CCGTCGTCCC CTTGCAGAAG AGTACGAGGC ACTACGTACT CCGAGACGTG 
TTGGTGATGT GCGTCTTCTC GGAGAGGGAC AGAGGCCCAT TTCATGGCTA GGTCTTTCAA 
GTCCTGCTGT GGTTTTGGAA TTAATTTTGC TAGCAATGCG CATAGTTGCT GTAGTCAGTG 
TGCGTCAGCC ACTCGAGATT TGTCTTTCAA TGTCCGGACC TGAAGTAGGG CCCAGACGTG 



950 
1020 
1030 
1140 
1150 



'60 
120 
130 
240 
' 300 
360 
420 
480 
540 
600 
660 
720 
780 
340 
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TTACCTGGTC 


TGGGACCGAC 


ATATGGTCGT 




900 


ATAGGTCTAG 


AGATTGCTW 


AGCTCTTGGA 


AGCGCTGGAC 


950 


GTTTAGGACG 


GTGGACGGTA 


CCCGAAGTCC 


AGAACTCTGA 


1020 


GGACCTTCGT 


AGGCCAATGT 


CGTGGCTTCA 


ACAACGAGAC 


1030 


AGTCCTGTAC 


GAAACCGTCG 


ACCTGGACAG 


AGGCCCAACA 


1140 
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ATTACCTAGG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH : 379 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: Protein 
(3) LOCATION: 1 

CD) OTHER INFORMATION : /note- "Met (ATG) starts at -1- 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Glu Pro Lys Ser Ala Asp Lys Thr His Thr Cys Pro Pro Cys Pro 

10 1S 

Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys 



25 



30 



?ro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val 



40 



45 



</al Val Asp val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr 

35 60 

Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu 

70 75 so 

Ola Tyr Asn Ser Vhr Tyr Arg Val Val Ser Val Leu Thr Val Leu His 
35 90 95 

Oln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys 



n 100 ios no 



Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin 
115 "0 125 

Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu 



135 



140 



Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 
145 15 ° 155 * lio 

Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn 
165 170 I75 

Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu 
130 135 190 

Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val 
195 200 205 

Phe Ser Cys Ser Val Mec His Glu Ala Leu His Asn His Tyr Thr Gin 
210 215 220 

Lys Ser Leu Ser Leu Ser Pro Gly Lys Val Pro He Gin Lys Val Gin 

225 230 235 240 

Asp Asp Thr Lys Thr Leu lie Lys Thr He Val Thr Arg He Asn Asp 
245 250 " 255 

He Ser His Thr Gin Ser Val Ser Ser Lys Gin Lys Val Thr Gly Leu 
260 265 270 

Asp Phe He Pro Gly Leu His Pro He Leu Thr Leu Ser Lys Met Asp 
275 280 235 

Gin Thr Leu Ala Val Tyr Gin Gin He Leu Thr Ser Met Pro Ser Ar'g 
290 295 300 

Asn Val He Gin He Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu 
305 310 315 320 

His Val Leu Ala Phe Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly 

325 330 335 

Leu Glu Thr Leu Asp Ser Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr 
340 345 350 

Ser Thr Glu Val Val Ala Leu Ser Arg Leu Gin Gly Ser Leu Gin Asp 
355 360 365 



Met Leu Trp Gin Leu Asp Leu Ser Pro Gly Cys 

370 375 



(2) INFORMATION FOR ScQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: L135 ba 39 oairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feacure 

(B) LOCATION: 4 

(D) OTHER INFORMATION : /noce» "Mec - ATG' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CATATGGACA AAACTCACAC ATGTCCACCT TGTCCAGCTC CGGAACTCCT GGGGGGTCCT 
TCAGTCTTCC TCTTCCCCCC AAAACCCAAG GACACCCTCA TGATCTCCCG GACCCCTGAG 
GTCACATGCG TGGTGGTGGA CGTGAGCCAC GAAGACCCTG AGGTCAAGTT CAACTGGTAC 
GTGGACGGCG TGGAGGTGCA TAATGCCAAG ACAAAGCCGC GGGAGGAGCA GTACAACAGC 
ACGTACCGTG TGGTCAGCGT CCTCACCGTC CTGCACCAGG ACTGGCTGAA TGGCAAGGAG 
TACAAGTGCA AGGTCTCCAA CAAAGCCCTC CCAGCCCCCA TCGAGAAAAC CATCTCCAAA 
GCCAAAGGGC AGCCCCGAGA ACCACAGGTG TACACCCTGC CCCCATCCCG GGATGAGCTG 
ACCAAGAACC AGGTCAGCCT GACCTGCCTO GTCAAAGGCT TCTATCCCAG CGACATCGCC 
GTGGAGTGGG AGAGCAATGG GCAGCCGGAG AACAACTACA AGACCACGCC TCCCGTGCTG 
GACTCCGACG GCTCCTTCTT CCTCTACAGC AAGCTCACCG TGGACAAGAG CAGGTGGCAG 
CAGGGGAACG TCTTCTCATG CTCCGTGATG CATGAGGCTC TGCACAACCA CTACACGCAG 
AAGAGCCTCT CCCTGTCTCC GGGTAAAGTA CCGATCCAGA AAGTTCAGGA CGACACCAAA 
ACCTTAATTA AAACGATCGT TACGCGTATC AACGACATCA GTCACACCCA GTCGGTGAGC 
TCTAAACAGA AAGTTACAGG CCTGGACTTC ATCCCGGGTC TGCACCCGAT CCTGACCTTG 
TCCAAAATGG ACCAGACCCT GGCTGTATAC CAGCAGATCT TAACCTCCAT GCCGTCCCGT 
AACGTTATCC AGATCTCTAA CGACCTCGAG AACCTTCGCG ACCTGCTGCA CGTGCTGGCA 
TTCTCCAAAT CCTGCCACCT GCCATGGGCT TCAGGTCTTG AGACTCTGGA CTCTCTGGGC 
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GGGCTCCTCG AAGCATCCGG TTACAGCACC GAAGTTGTTG CTCTGTCCCG TCTGCAGGGT I0 30 
TCCCTTCAGG ACATCCTTTG GCAGCTGGAC CTGTCTCCGG GTTGTTAATG GATCC U35 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pair 3 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTATACCTGT TTTGAGTGTG TACAGGTGGA ACAGGTCGAG GCCTTGAGGA CCCCCCAGGA 
AGTCAGAAGG AGAAGGGGGG TTTTGGGTTC CTGTGGGAGT ACTAGAGGGC CTGGGGACTC 
CAGTGTACGC ACCACCACCT GCACTCGGTG CTTCTGGGAC TCCAGTTCAA GTTGACCATG 
CACCTGCCGC ACCTCCACGT ATTACGGTTC TGTTTCGGCG CCCTCCTCGT CATGTTGTCG 
TGCATGGCAC ACCAGTCGCA GGAGTGGCAG GACGTGGTCC TGACCGACTT ACCGTTCCTC 
atgttcacgt' TCCAGAGGTT GTTTCGGGAO GGTCGGGGGT AGCTCTTTTG GTAGAGGTTT 
CGGTTTCCCG TCGGGGCTCT TGGTGTCCAC ATGTGGGACG GGGGTAGGGC CCTACTCGAC 420 
TGGTTCTTGG TCCAGTCGGA CTGGACGGAC CAGTTTCCGA AGATAGGGTC GCTGTAGCGG f 480 
CACCTCACCC TCTCGTTACC CGTCGGCCTC TTGTTGATGT TCTGGTGCGG AGGGCACGAC 540 
CTGAGGCTGC CGAGGAAGAA GGAGATGTCG TTCGAGTGGC ACCTGTTCTC GTCCACCGTC 600 
GTCCCCTTGC AGAAGAGTAC' GAGGCACTAC GTACTCCGAG ACGTGTTGGT GATGTGCGTC 660 
| TTCTCGGAGA -GGGACAGAGG CCCATTTCAT GGCTAGGTCT TTCAAGTCCT GC TGTGGTTT 720 

TGGAATTAAT TTTGCTAGCA ATGCGCATAG TTGCTGTAGT CAGTGTGGGT CAGCCACTCG 7 30 

AGATTTGTCT TTCAATGTCC GGACCTGAAG TAGGGCCCAG ACGTGGGCTA GGACTGGAAC 840 
AGGTTTTACC TGGTCTGGGA CCGACATATG GTCGTCTAGA ATTGGAGGTA CGGCAGGGCA 900 
TTGCAATAGG TCTAGAGATT GCTGGAGCTC TTGGAAGCGC TGGACGACGT GCACGACCGT 9 60 

AAGAGGTTTA GGACGGTGGA CGGTACCCGA AGTCCAGAAC TCTGAGACCT GAGAGACCCG 1020 
CCCCAGGACC TTCGTAGGCC AATGTCGTGG CTTCAACAAC GAGACAGGGC AGACGTCCCA 1080 



60 
120 
13Q 
240 
300 
360 
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AGGGAAGTCC TGTACGAAAC CGTCGACCTG GACAGAGGCC CAACAATTAC CTAGG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/ KEY: Protein 
<B) LOCATION: 1 

(D) OTHER INFORMATION: /notes -Met (ATG) starts at 
(xt) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu 

1 5 10 is 

Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu 
20 25 30 

Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser 
35 40 45 

His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu 
50 55 60 

Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr 

65 70 75 30 

Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu Asn 
35 90 95 

Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro 
100 105 110 

He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin 

.115 120 125 

Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gin Val 
130 135 140 

Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp He Ala Val 

145 150 " 155 160 



1135 



O 3 



Glu Trp Glu 5sr Asn Gly G'n Pro ri„ a 

im Pro Glu Asn Asn Tyr Lys Thr Thr Pro 

Pro Val Leu Asp Ser Asp Gly Ser pha 

i-sO 1aa u iy * 33 r Lys Lau Thr 

135 190 



v.i ... s„ 4cg Tr5 0ln 0ly A „ vai ?h> s 

<i00 205 
Met His Glu Ala Leu His Asn « ( , 

210 HlS Hls Thr Gin Lya Ser Leu Ser Leu 

220 

S« Pro Gly Lys Vll tl . oln Ly> ^ ^ ^ ^ ^ 

235 240 
L~ U. , y . Thr val Thr Acj u . A , p „, 

250 2S5 
Ser Val Ser Ser Ly, Gin Lys Val Thr Gly Leu Asp Ph . X1 . Pro Gly 



265 270 



Leu His Pro lie Leu Thr Leu Ser Lvs Met a-« «w 

275 yS MeC A3 *> Gln Thr Leu Ala Val 

280 285 

tyr sin <u„ n. t . u ^ s M . e Pco sar ^ &>n vsi n# 

295 300 



Ser Asn Asp Leu Glu Asn Leu 



Arg Asp Leu Leu His Val Leu 



3 05 . 31Q * |T eu Bls v «l Leu Ala Phe 



315 



320 



Ser Lys Ser Cys Hi, Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp 

325 330 335 

3er Leu Gly Gly Val Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Va*' 

34S 3S0 

Ala Leu Ser Arg Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu 
35 36 ° 3S5 

Asp Leu Ser Pro Gly Cys 

370 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



I 
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(ix) FEATURE: 

(A) NAME /KEY: misc.caacure 
(3) LOCATION: 4 

(D) OTHER INFORMATION: /notaa "Mat = ATG* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CATATGGACA AAACTCACAC ATGCCCACCG TGCCCAGCTC CGGAACTCGA AGGTGGTCCG 50 

TCAGTCTTCC TCTTCCCCCC AAAACCCAAG GACACCCTCA TGATCTCCCG GACCCCTGAG 120 

GTCACATGCG TGGTGGTGGA CGTGAGCCAC GAAGACCCTG AGGTCAAGTT CAACTGGTAC 180 

GTGGACGGCG TGGAGGTGCA TAATGCCAAG ACAAAGCCGC GGGAGGAGCA GTACAACAGC 2 40 

ACGTACCGTG TGGTCAGCGT CCTCACCGTC CTGCACCAGG ACTGGCTGAA TGGCAAAGCT 300 * * 

TACGCATGCG CGGTCTCCAA CAAAGCCCTC CCAGCCCCCA TCGAGAAAAC CATCTCCAAA 3 60 

GCCAAAGGGC AGCCCCGAGA ACCACAGGTG TACACCCTGC CCCCATCCCG GGATGAGCTG 420 

ACCAAGAACC AGGTCAGCCT GACCTGCCTG GTCAAAGGCT TCtATCCCAG CGACATCGCC 430 

GTGGAGTGGG AGAGCAATGG GCAGCCGGAG AACAACTACA AGACCACGCC TCCCGTGCTG 540 

GACTCCGACG GCTCCTTCTT CCTCTACAGC AAGCTCACCG TGGACAAGAG CAGGTGGCAG 600 

CAGGGGAACG TCTTCTCATG CTCCGTGATG CATGAGGCTC TGCACAACCA CTACACGCAG 660 

AAGAGCCTCT CCCTGTCTCC GGGTAAAGTA CCGATCCAGA AAGTTCAGGA CGACACCAAA 720 
ACCTTAATTA AAACGATCGT TACGCGTATC AACGACATCA GTCACACCCA GTCGGTGAGC ■' 730 

TCTAAACAGA AAGTTACAGG CCTGGACTTC ATCCCGGGTC TGCACCCGAT CCTGACCTTG 3 40 

TCCAAAATGG ACCAGACCCT GGCTGTATAC CAGCAGATCT TAACCTCCAT GCCGTCCCGT 900 

AACGTTATCC AGATCTCTAA CGACCTCGAG AACCTTCGCG ACCTGCTGCA CGTGCTGGCA 960 

TTCTCCAAAT CCTGCCACCT 1 GCCATGGGCT TCAGGTCTTG AGACTCTGGA CTCTCTGGGC 1020 

GGGGTCCTGG AAGCATCCGG TTACAGCACC GAAGTTGTTG CTCTGTCCCG TCTGCAGGGT 1080 

TCCCTTCAGG ACATGCTTTG GCAGCTGGAC CTGTCTCCGG GTTGTTAATG GATCC 113 5 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTATACCTGT TTTGAGTGTG TACGGGTGGC ACGGGTCGAG GCCTTGAGCT TCCACCAGGC 
AGTCAGAAGG AGAAGGGGGG TTTTGGGTTC CTGTGGGAGT ACTAGAGGGC CTGGGGACTC 
CAGTGTACGC ACCACCACCT GCACTCGGTG CTTCTGGGAC TCCAGTTCAA GTTGACCATG 
CACCTGCCGC ACCTCCACGT ATTACGGTTC TGTTTCGGCG CCCTCCTCGT CATGTTGTCG 
TGCATGGCAC ACCAGTCGCA GGAGTGGCAG GACGTGGTCC TGACCGACTT ACCGTTTCGA 
ATGCGTACGC GCCAGAGGTT GTTTCGGGAG GGTCGGGGGT AGCTCTTTTG GTAGAGGTTT 
CGGTTTCCCG TCGGGGCTCT TGGTGTCCAC ATGTGGGACG GGGGTAGGGC CCTACTCGAC 
TGGTTCTTGG TCCAGTCGGA CTGGACGGAC CAGTTTCCGA AGATAGGGTC GCTGTAGCGG 
CACCTCACCC TCTCGTTACC CGTCGGCCTC TTGTTGATGT TCTGGTGCGG AGGGCACGAC 
CTGAGGCTGC CGAGGAAGAA GGAGATGTCG TTCGAGTGGC ACCTGTTCTC GTCCACCGTC 
OTCCCCTTGC AGAAGAGTAC GAGGCACTAC GTACTCCGAG ACGTGTTGGT GATGTGCGTC 
TTCTCGGAGA GGGACAGAGG CCCATTTCAT GGCTAGGTCT TTCAAGTCCT GCTGTGGTTT 
TGGAATTAAT TTTGCTAGCA ATGCGCATAG TTGCTGTAGT CAGTGTGGGT CAGCCACTCG 
AGATTTGTCT TTCAATGTCC GGACCTGAAG TAGGGCCCAG ACGTGGGCTA GGACTGGAAC 
AGGTTTTACC TGGTCTGGGA CCGACATATG GTCGTCTAGA ATTGGAGGTA CGGCAGGGCA 
TTGCAATAGG TCTAGAGATT GCTGGAGCTC TTGGAAGCGC TGGACGACGT GCACGACCGT 
AAGAGGTTTA GGACGGTGGA CGGTACCCGA AGTCCAGAAC TCTGAGACCT GAGAGACCCG 
CCCCAGGACC TTCGTAGGCC AATGTCGTGG CTTCAACAAC GAGACAGGGC AGACGTCCCA 
AGGGAAGTCC TGTACGAAAC CGTCGACCTG GACAGAGGCC CAACAATTAC CTAGG 



60 
120 
180 *'• 



240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1030 
1135 



INFORMATION FOR 3SQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 74 amino acids 

(B) TYPE: amino acid 

. (C) STRANDEDNES5 : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY: Protein 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- "Met (ATG) starts at -1" 
(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Glu 

1 5 10 15 

Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu 

20 25 30 

Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser 
35 40 45 

His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu 
50 55 60 

Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr, 
65 70 75 30 ; 

Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu Asn 

85 90 95 

Gly Lys Ala Tyr Ala Cys Ala Val Ser Asn Lys Ala Leu Pro Ala Pro 

100 105 110 

lie Glu" Lys Thr lie Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin 
115 120 125 

Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gin Val 
130 135 140 

Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp He Ala Val 
145 150 155 160 

Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro 
165 170 175 
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Pro Val Leu Asp Ser Asp Gly Ser Phe. Phe Leu Tyr Ser Lys Leu Thr 
180 '-3S L90 

Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val 
195 200 205 

Mec His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu 
210 215 220 

Ser Pro Gly Lys Val Pro lie Gin Lys Val Gin Asp Asp Thr Lys Thr 
225 230 235 240 

Leu He Lys Thr He Val Thr Arg lie Asn Asp Ila Ser His Thr Gin 

250 255 

Ser Val Ser Ser Lys Gin Lys Val Thr Gly Leu Asp Phe lie Pro Gly 
2 «0 265 270 

Leu His Pro He Leu Leu Ser Lys Met Asp Gin Thr Leu Ala Val 

27S 280 285 

Tyr Gin Gin lie Leu Thr Ser Mec Pro Ser Arg Asn Val lie Gin lie 

290 295 300 

Ser Asn Asp Leu Glu Asn Leu Arg Asp Leu Leu His Val Leu Ala Phe 

3 °S 310 315 320 

Ser Lys Ser Cys His Leu Pro Trp Ala Ser Gly Leu Glu Thr Leu Asp 
325 330 335 

Ser Leu Gly Gly Vai Leu Glu Ala Ser Gly Tyr Ser Thr Glu Val Val 
340 345 350 

Ala Leu Ser Arg Leu Gin Gly Ser Leu Gin Asp Met Leu Trp Gin Leu 

355 360 365 



A3p Leu Ser Pro Gly Cys 
370 



